Overview

Dataset info

Number of variables29
Number of observations4441
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory4.6 MiB
Average record size in memory1.1 KiB

Variables types

NUM17
CAT11
URL1

Reproduction info

Date of analysis2020-01-17 12:40:30.800175
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

actor_1_facebook_likes is highly skewed (γ1 = 20.39377482) Skewed
actor_1_name has a high cardinality: 1807 distinct values Warning
actor_2_name has a high cardinality: 2673 distinct values Warning
actor_3_facebook_likes has 52 (1.2%) zeros Zeros
actor_3_name has a high cardinality: 3156 distinct values Warning
budget is highly skewed (γ1 = 25.93138816) Skewed
country has a high cardinality: 54 distinct values Warning
director_facebook_likes has 753 (17.0%) zeros Zeros
director_name has a high cardinality: 2100 distinct values Warning
facenumber_in_poster has 1890 (42.6%) zeros Zeros
genres has a high cardinality: 851 distinct values Warning
movie_facebook_likes has 2024 (45.6%) zeros Zeros
movie_title has a high cardinality: 4441 distinct values Warning
plot_keywords has a high cardinality: 4437 distinct values Warning
cast_total_facebook_likes is highly correlated with actor_1_facebook_likesHigh Correlation
actor_1_facebook_likes is highly correlated with cast_total_facebook_likesHigh Correlation

Variables

actor_1_facebook_likes
Real number (ℝ≥0)

SKEWED
HIGH CORRELATION
Distinct count819
Unique (%)18.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6910.11011
Minimum0
Maximum640000
Zeros9
Zeros (%)0.2%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile144
Q1660
median1000
Q312000
95-th percentile24000
Maximum640000
Range640000
Interquartile range (IQR)11340

Descriptive statistics

Standard deviation14779.6342
Coefficient of variation (CV)2.138842069
Kurtosis786.7349524
Mean6910.11011
Median Absolute Deviation (MAD)7864.497057
Skewness20.39377482
Sum30687799
Variance218437587.2
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 3.715e+02 5.755e+02 8.255e+02 ... 3.750e+04 4.200e+04 6.300e+04 1.505e+05 6.400e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1000 404 9.1%
 
11000 204 4.6%
 
2000 179 4.0%
 
3000 141 3.2%
 
12000 131 2.9%
 
13000 121 2.7%
 
14000 119 2.7%
 
18000 105 2.4%
 
10000 104 2.3%
 
22000 77 1.7%
 
Other values (809) 2856 64.3%
 
ValueCountFrequency (%) 
0 9 0.2%
 
2 5 0.1%
 
3 2 < 0.1%
 
4 1 < 0.1%
 
5 3 0.1%
 
ValueCountFrequency (%) 
640000 1 < 0.1%
 
260000 1 < 0.1%
 
164000 2 < 0.1%
 
137000 2 < 0.1%
 
87000 8 0.2%
 

actor_1_name
Categorical

HIGH CARDINALITY
Distinct count1807
Unique (%)40.7%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Robert De Niro
 
47
Johnny Depp
 
36
Nicolas Cage
 
32
J.K. Simmons
 
29
Denzel Washington
 
29
Other values (1802)
4268
ValueCountFrequency (%) 
Robert De Niro 47 1.1%
 
Johnny Depp 36 0.8%
 
Nicolas Cage 32 0.7%
 
J.K. Simmons 29 0.7%
 
Denzel Washington 29 0.7%
 
Matt Damon 28 0.6%
 
Bruce Willis 28 0.6%
 
Steve Buscemi 27 0.6%
 
Harrison Ford 27 0.6%
 
Liam Neeson 26 0.6%
 
Other values (1797) 4132 93.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length27
Mean length13.19229903
Min length4
Scatter

actor_2_facebook_likes
Real number (ℝ≥0)

Distinct count897
Unique (%)20.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1748.402387
Minimum0
Maximum137000
Zeros24
Zeros (%)0.5%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile44
Q1321
median626
Q3936
95-th percentile11000
Maximum137000
Range137000
Interquartile range (IQR)615

Descriptive statistics

Standard deviation4178.221576
Coefficient of variation (CV)2.389736829
Kurtosis253.2704876
Mean1748.402387
Median Absolute Deviation (MAD)2089.040018
Skewness9.904953976
Sum7764655
Variance17457535.53
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 2.950e+01 3.235e+02 3.295e+02 ... 1.150e+04 1.450e+04 2.250e+04 2.800e+04 1.370e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1000 288 6.5%
 
11000 105 2.4%
 
2000 93 2.1%
 
3000 72 1.6%
 
10000 45 1.0%
 
13000 39 0.9%
 
14000 38 0.9%
 
826 35 0.8%
 
4000 32 0.7%
 
12000 29 0.7%
 
Other values (887) 3665 82.5%
 
ValueCountFrequency (%) 
0 24 0.5%
 
2 8 0.2%
 
3 7 0.2%
 
4 6 0.1%
 
5 8 0.2%
 
ValueCountFrequency (%) 
137000 1 < 0.1%
 
29000 1 < 0.1%
 
27000 2 < 0.1%
 
25000 2 < 0.1%
 
23000 6 0.1%
 

actor_2_name
Categorical

HIGH CARDINALITY
Distinct count2673
Unique (%)60.2%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Morgan Freeman
 
18
Charlize Theron
 
14
Brad Pitt
 
13
Meryl Streep
 
11
Adam Sandler
 
10
Other values (2668)
4375
ValueCountFrequency (%) 
Morgan Freeman 18 0.4%
 
Charlize Theron 14 0.3%
 
Brad Pitt 13 0.3%
 
Meryl Streep 11 0.2%
 
Adam Sandler 10 0.2%
 
James Franco 10 0.2%
 
Bruce Willis 9 0.2%
 
Scott Glenn 9 0.2%
 
Will Ferrell 9 0.2%
 
Kirsten Dunst 8 0.2%
 
Other values (2663) 4330 97.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length28
Mean length13.06282369
Min length3
Scatter

actor_3_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count901
Unique (%)20.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean672.8203107
Minimum0
Maximum23000
Zeros52
Zeros (%)1.2%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile16
Q1158
median391
Q3650
95-th percentile1000
Maximum23000
Range23000
Interquartile range (IQR)492

Descriptive statistics

Standard deviation1699.183083
Coefficient of variation (CV)2.525463421
Kurtosis57.79872839
Mean672.8203107
Median Absolute Deviation (MAD)584.2156617
Skewness7.112386817
Sum2987995
Variance2887223.151
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 1.850e+01 1.075e+02 2.875e+02 ... 4.500e+03 9.500e+03 1.150e+04 1.450e+04 2.300e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1000 115 2.6%
 
0 52 1.2%
 
11000 27 0.6%
 
2000 25 0.6%
 
3000 24 0.5%
 
826 21 0.5%
 
249 19 0.4%
 
7 17 0.4%
 
51 16 0.4%
 
3 16 0.4%
 
Other values (891) 4109 92.5%
 
ValueCountFrequency (%) 
0 52 1.2%
 
2 13 0.3%
 
3 16 0.4%
 
4 15 0.3%
 
5 8 0.2%
 
ValueCountFrequency (%) 
23000 2 < 0.1%
 
20000 1 < 0.1%
 
19000 4 0.1%
 
17000 1 < 0.1%
 
16000 3 0.1%
 

actor_3_name
Categorical

HIGH CARDINALITY
Distinct count3156
Unique (%)71.1%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Steve Coogan
 
8
Robert Duvall
 
7
Sam Shepard
 
7
Ben Mendelsohn
 
7
Stephen Root
 
7
Other values (3151)
4405
ValueCountFrequency (%) 
Steve Coogan 8 0.2%
 
Robert Duvall 7 0.2%
 
Sam Shepard 7 0.2%
 
Ben Mendelsohn 7 0.2%
 
Stephen Root 7 0.2%
 
Jon Gries 6 0.1%
 
John Gielgud 6 0.1%
 
Lois Maxwell 6 0.1%
 
Thomas Lennon 6 0.1%
 
Bruce McGill 6 0.1%
 
Other values (3146) 4375 98.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length27
Mean length13.0565188
Min length3
Scatter

aspect_ratio
Real number (ℝ≥0)

Distinct count20
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.104130384
Minimum1.18
Maximum16
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum1.18
5-th percentile1.78
Q11.85
median2.2
Q32.35
95-th percentile2.35
Maximum16
Range14.82
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation0.5009573553
Coefficient of variation (CV)0.2380828484
Kurtosis531.2858039
Mean2.104130384
Median Absolute Deviation (MAD)0.2715259508
Skewness19.13615632
Sum9344.443036
Variance0.2509582718
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1.18 1.265 1.35 1.435 1.58 ... 2.295 2.37 2.395 2.655 16. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.35 2189 49.3%
 
1.85 1808 40.7%
 
2.104130384 146 3.3%
 
1.37 92 2.1%
 
1.78 63 1.4%
 
1.66 60 1.4%
 
1.33 31 0.7%
 
2.39 14 0.3%
 
2.2 13 0.3%
 
16 4 0.1%
 
Other values (10) 21 0.5%
 
ValueCountFrequency (%) 
1.18 1 < 0.1%
 
1.2 1 < 0.1%
 
1.33 31 0.7%
 
1.37 92 2.1%
 
1.5 2 < 0.1%
 
ValueCountFrequency (%) 
16 4 0.1%
 
2.76 3 0.1%
 
2.55 2 < 0.1%
 
2.4 3 0.1%
 
2.39 14 0.3%
 

budget
Real number (ℝ≥0)

SKEWED
Distinct count404
Unique (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38284338.29
Minimum218
Maximum4200000000
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum218
5-th percentile1000000
Q18000000
median24000000
Q342000000
95-th percentile125000000
Maximum4200000000
Range4199999782
Interquartile range (IQR)34000000

Descriptive statistics

Standard deviation99322701.58
Coefficient of variation (CV)2.594342909
Kurtosis899.8248003
Mean38284338.29
Median Absolute Deviation (MAD)31683166.12
Skewness25.93138816
Sum1.700207463e+11
Variance9.864999048e+15
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[2.1800e+02 3.0600e+05 4.8750e+05 5.2500e+05 9.8000e+05 ... 1.9750e+08 2.0350e+08 2.6185e+08 8.5000e+08 4.2000e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
36541497.42 309 7.0%
 
20000000 166 3.7%
 
30000000 134 3.0%
 
25000000 133 3.0%
 
15000000 133 3.0%
 
40000000 128 2.9%
 
10000000 126 2.8%
 
35000000 117 2.6%
 
50000000 98 2.2%
 
5000000 97 2.2%
 
Other values (394) 3000 67.6%
 
ValueCountFrequency (%) 
218 1 < 0.1%
 
1100 1 < 0.1%
 
4500 1 < 0.1%
 
7000 3 0.1%
 
9000 1 < 0.1%
 
ValueCountFrequency (%) 
4200000000 1 < 0.1%
 
2500000000 1 < 0.1%
 
2400000000 1 < 0.1%
 
2127519898 1 < 0.1%
 
1100000000 1 < 0.1%
 

cast_total_facebook_likes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3722
Unique (%)83.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10224.04346
Minimum0
Maximum656730
Zeros9
Zeros (%)0.2%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile313
Q11585
median3352
Q314672
95-th percentile37645
Maximum656730
Range656730
Interquartile range (IQR)13087

Descriptive statistics

Standard deviation18035.4089
Coefficient of variation (CV)1.764019194
Kurtosis400.1457367
Mean10224.04346
Median Absolute Deviation (MAD)10335.84546
Skewness13.32508768
Sum45404977
Variance325275974.2
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 1.00000e+00 2.76450e+03 3.31100e+03 3.98700e+03 ... 5.40790e+04 6.46985e+04 9.22280e+04 1.55193e+05 6.56730e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9 0.2%
 
29 5 0.1%
 
2020 5 0.1%
 
1044 5 0.1%
 
1227 4 0.1%
 
2251 4 0.1%
 
646 4 0.1%
 
2 4 0.1%
 
1761 4 0.1%
 
2321 4 0.1%
 
Other values (3712) 4393 98.9%
 
ValueCountFrequency (%) 
0 9 0.2%
 
2 4 0.1%
 
4 1 < 0.1%
 
5 3 0.1%
 
6 2 < 0.1%
 
ValueCountFrequency (%) 
656730 1 < 0.1%
 
303717 1 < 0.1%
 
263584 1 < 0.1%
 
170118 1 < 0.1%
 
140268 1 < 0.1%
 

color
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Color
4257
Black and White
 
184
ValueCountFrequency (%) 
Color 4257 95.9%
 
Black and White 184 4.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length16
Mean length5.455753209
Min length5
Scatter

content_rating
Categorical

Distinct count15
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
R
2027
PG-13
1384
PG
666
G
 
109
Not Rated
 
99
Other values (10)
 
156
ValueCountFrequency (%) 
R 2027 45.6%
 
PG-13 1384 31.2%
 
PG 666 15.0%
 
G 109 2.5%
 
Not Rated 99 2.2%
 
Unrated 56 1.3%
 
Approved 54 1.2%
 
X 12 0.3%
 
Passed 9 0.2%
 
NC-17 7 0.2%
 
Other values (5) 18 0.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length9
Mean length2.759063274
Min length1
Scatter

country
Categorical

HIGH CARDINALITY
Distinct count54
Unique (%)1.2%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
USA
3422
UK
 
398
France
 
130
Canada
 
100
Germany
 
90
Other values (49)
 
301
ValueCountFrequency (%) 
USA 3422 77.1%
 
UK 398 9.0%
 
France 130 2.9%
 
Canada 100 2.3%
 
Germany 90 2.0%
 
Australia 49 1.1%
 
Spain 31 0.7%
 
Japan 18 0.4%
 
Italy 17 0.4%
 
China 17 0.4%
 
Other values (44) 169 3.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length14
Mean length3.438639946
Min length2
Scatter

df_index
Real number (ℝ≥0)

UNIQUE
Distinct count4441
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2374.020941
Minimum0
Maximum5042
Zeros1
Zeros (%)< 0.1%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile231
Q11155
median2327
Q33564
95-th percentile4674
Maximum5042
Range5042
Interquartile range (IQR)2409

Descriptive statistics

Standard deviation1412.260583
Coefficient of variation (CV)0.5948812661
Kurtosis-1.144780233
Mean2374.020941
Median Absolute Deviation (MAD)1216.970649
Skewness0.09422882027
Sum10543027
Variance1994479.955
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 4106.5 5042. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
2604 1 < 0.1%
 
4643 1 < 0.1%
 
2596 1 < 0.1%
 
549 1 < 0.1%
 
4647 1 < 0.1%
 
2600 1 < 0.1%
 
553 1 < 0.1%
 
557 1 < 0.1%
 
2616 1 < 0.1%
 
Other values (4431) 4431 99.8%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
5042 1 < 0.1%
 
5037 1 < 0.1%
 
5035 1 < 0.1%
 
5034 1 < 0.1%
 
5033 1 < 0.1%
 

director_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count424
Unique (%)9.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean739.654132
Minimum0
Maximum23000
Zeros753
Zeros (%)17.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q19
median54
Q3212
95-th percentile1000
Maximum23000
Range23000
Interquartile range (IQR)203

Descriptive statistics

Standard deviation2928.865929
Coefficient of variation (CV)3.959777688
Kurtosis24.6824728
Mean739.654132
Median Absolute Deviation (MAD)1154.941447
Skewness4.993370888
Sum3284804
Variance8578255.629
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00e+00 1.00e+00 1.35e+01 2.55e+01 4.45e+01 ... 1.25e+04 1.45e+04 1.55e+04 1.75e+04 2.30e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 753 17.0%
 
6 57 1.3%
 
11 52 1.2%
 
7 52 1.2%
 
3 51 1.1%
 
2 50 1.1%
 
4 48 1.1%
 
10 47 1.1%
 
9 46 1.0%
 
13 45 1.0%
 
Other values (414) 3240 73.0%
 
ValueCountFrequency (%) 
0 753 17.0%
 
2 50 1.1%
 
3 51 1.1%
 
4 48 1.1%
 
5 42 0.9%
 
ValueCountFrequency (%) 
23000 1 < 0.1%
 
22000 8 0.2%
 
21000 10 0.2%
 
18000 4 0.1%
 
17000 20 0.5%
 

director_name
Categorical

HIGH CARDINALITY
Distinct count2100
Unique (%)47.3%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Steven Spielberg
 
26
Woody Allen
 
22
Martin Scorsese
 
20
Clint Eastwood
 
20
Ridley Scott
 
16
Other values (2095)
4337
ValueCountFrequency (%) 
Steven Spielberg 26 0.6%
 
Woody Allen 22 0.5%
 
Martin Scorsese 20 0.5%
 
Clint Eastwood 20 0.5%
 
Ridley Scott 16 0.4%
 
Renny Harlin 15 0.3%
 
Steven Soderbergh 15 0.3%
 
Spike Lee 15 0.3%
 
Tim Burton 14 0.3%
 
Oliver Stone 14 0.3%
 
Other values (2090) 4264 96.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length32
Mean length13.0689034
Min length3
Scatter

duration
Real number (ℝ≥0)

Distinct count157
Unique (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108.81288
Minimum20
Maximum330
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum20
5-th percentile84
Q194
median104
Q3119
95-th percentile146
Maximum330
Range310
Interquartile range (IQR)25

Descriptive statistics

Standard deviation22.32140079
Coefficient of variation (CV)0.2051356493
Kurtosis12.19301058
Mean108.81288
Median Absolute Deviation (MAD)15.77887457
Skewness2.313159844
Sum483238
Variance498.2449332
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 20. 64. 73.5 79.5 85.5 ... 142.5 154.5 179. 226.5 330. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
90 133 3.0%
 
100 127 2.9%
 
98 125 2.8%
 
101 122 2.7%
 
93 116 2.6%
 
99 115 2.6%
 
97 115 2.6%
 
94 111 2.5%
 
95 111 2.5%
 
107 102 2.3%
 
Other values (147) 3264 73.5%
 
ValueCountFrequency (%) 
20 1 < 0.1%
 
25 1 < 0.1%
 
37 1 < 0.1%
 
45 1 < 0.1%
 
53 1 < 0.1%
 
ValueCountFrequency (%) 
330 1 < 0.1%
 
325 1 < 0.1%
 
300 1 < 0.1%
 
293 1 < 0.1%
 
289 1 < 0.1%
 

facenumber_in_poster
Real number (ℝ≥0)

ZEROS
Distinct count19
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.363882009
Minimum0
Maximum43
Zeros1890
Zeros (%)42.6%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum43
Range43
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.012683133
Coefficient of variation (CV)1.475701799
Kurtosis57.95438239
Mean1.363882009
Median Absolute Deviation (MAD)1.344092358
Skewness4.65478897
Sum6057
Variance4.050893395
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 6.5 8.5 10.5 17. 43. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1890 42.6%
 
1 1118 25.2%
 
2 639 14.4%
 
3 339 7.6%
 
4 181 4.1%
 
5 92 2.1%
 
6 64 1.4%
 
7 43 1.0%
 
8 34 0.8%
 
9 13 0.3%
 
Other values (9) 28 0.6%
 
ValueCountFrequency (%) 
0 1890 42.6%
 
1 1118 25.2%
 
2 639 14.4%
 
3 339 7.6%
 
4 181 4.1%
 
ValueCountFrequency (%) 
43 1 < 0.1%
 
31 1 < 0.1%
 
19 1 < 0.1%
 
15 4 0.1%
 
14 1 < 0.1%
 

genres
Categorical

HIGH CARDINALITY
Distinct count851
Unique (%)19.2%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Drama
 
195
Comedy
 
177
Comedy|Drama|Romance
 
174
Comedy|Drama
 
167
Drama|Romance
 
143
Other values (846)
3585
ValueCountFrequency (%) 
Drama 195 4.4%
 
Comedy 177 4.0%
 
Comedy|Drama|Romance 174 3.9%
 
Comedy|Drama 167 3.8%
 
Drama|Romance 143 3.2%
 
Comedy|Romance 142 3.2%
 
Crime|Drama|Thriller 89 2.0%
 
Action|Crime|Thriller 60 1.4%
 
Action|Crime|Drama|Thriller 59 1.3%
 
Horror 57 1.3%
 
Other values (841) 3178 71.6%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length64
Mean length20.63476694
Min length5
Scatter

gross
Real number (ℝ≥0)

Distinct count3925
Unique (%)88.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48652043.17
Minimum162
Maximum760505847
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum162
5-th percentile146402
Q17825820
median33000377
Q355942830
95-th percentile170708996
Maximum760505847
Range760505685
Interquartile range (IQR)48117010

Descriptive statistics

Standard deviation63839429.28
Coefficient of variation (CV)1.312163377
Kurtosis17.02826811
Mean48652043.17
Median Absolute Deviation (MAD)39847568.2
Skewness3.30212485
Sum2.160637237e+11
Variance4.075472731e+15
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.62000000e+02 2.72345000e+04 1.46242500e+05 5.41472500e+05 1.34002150e+06 ... 1.84567166e+08 2.62000639e+08 3.38791386e+08 4.67740171e+08 7.60505847e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47644514.53 499 11.2%
 
8000000 3 0.1%
 
78900000 2 < 0.1%
 
36000000 2 < 0.1%
 
2000000 2 < 0.1%
 
30400000 2 < 0.1%
 
26400000 2 < 0.1%
 
1000000 2 < 0.1%
 
800000 2 < 0.1%
 
25000000 2 < 0.1%
 
Other values (3915) 3923 88.3%
 
ValueCountFrequency (%) 
162 1 < 0.1%
 
703 1 < 0.1%
 
721 1 < 0.1%
 
828 1 < 0.1%
 
1111 1 < 0.1%
 
ValueCountFrequency (%) 
760505847 1 < 0.1%
 
658672302 1 < 0.1%
 
652177271 1 < 0.1%
 
623279547 1 < 0.1%
 
533316061 1 < 0.1%
 

imdb_score
Real number (ℝ≥0)

Distinct count76
Unique (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.432785409
Minimum1.6
Maximum9.3
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum1.6
5-th percentile4.4
Q15.8
median6.6
Q37.2
95-th percentile8
Maximum9.3
Range7.7
Interquartile range (IQR)1.4

Descriptive statistics

Standard deviation1.099525537
Coefficient of variation (CV)0.1709252629
Kurtosis1.1148243
Mean6.432785409
Median Absolute Deviation (MAD)0.8485388831
Skewness-0.7731944673
Sum28568
Variance1.208956406
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.6 2.65 3.25 3.95 4.55 ... 7.85 8.15 8.55 8.95 9.3 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.7 205 4.6%
 
6.6 184 4.1%
 
6.5 175 3.9%
 
6.4 174 3.9%
 
6.8 170 3.8%
 
7.2 168 3.8%
 
7.1 164 3.7%
 
6.1 162 3.6%
 
7.3 160 3.6%
 
6.3 158 3.6%
 
Other values (66) 2721 61.3%
 
ValueCountFrequency (%) 
1.6 1 < 0.1%
 
1.7 1 < 0.1%
 
1.9 3 0.1%
 
2 2 < 0.1%
 
2.1 3 0.1%
 
ValueCountFrequency (%) 
9.3 1 < 0.1%
 
9.2 1 < 0.1%
 
9 2 < 0.1%
 
8.9 5 0.1%
 
8.8 5 0.1%
 

language
Categorical

Distinct count37
Unique (%)0.8%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
English
4213
French
 
50
Spanish
 
33
Mandarin
 
19
German
 
14
Other values (32)
 
112
ValueCountFrequency (%) 
English 4213 94.9%
 
French 50 1.1%
 
Spanish 33 0.7%
 
Mandarin 19 0.4%
 
German 14 0.3%
 
Hindi 14 0.3%
 
Japanese 13 0.3%
 
Portuguese 8 0.2%
 
Cantonese 8 0.2%
 
Italian 8 0.2%
 
Other values (27) 61 1.4%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length10
Mean length6.988966449
Min length4
Scatter

movie_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count798
Unique (%)18.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7917.134204
Minimum0
Maximum349000
Zeros2024
Zeros (%)45.6%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median177
Q35000
95-th percentile42000
Maximum349000
Range349000
Interquartile range (IQR)5000

Descriptive statistics

Standard deviation19864.23539
Coefficient of variation (CV)2.509018399
Kurtosis40.12413577
Mean7917.134204
Median Absolute Deviation (MAD)11449.1077
Skewness4.970180909
Sum35159993
Variance394587847.5
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000e+00 2.000e+00 9.995e+02 1.500e+03 3.500e+03 ... 6.550e+04 8.400e+04 1.515e+05 1.980e+05 3.490e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2024 45.6%
 
1000 101 2.3%
 
11000 76 1.7%
 
10000 72 1.6%
 
13000 58 1.3%
 
12000 56 1.3%
 
2000 51 1.1%
 
15000 47 1.1%
 
16000 45 1.0%
 
14000 44 1.0%
 
Other values (788) 1867 42.0%
 
ValueCountFrequency (%) 
0 2024 45.6%
 
4 1 < 0.1%
 
7 2 < 0.1%
 
10 1 < 0.1%
 
12 2 < 0.1%
 
ValueCountFrequency (%) 
349000 1 < 0.1%
 
199000 1 < 0.1%
 
197000 1 < 0.1%
 
191000 1 < 0.1%
 
190000 1 < 0.1%
 
Distinct count4441
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
http://www.imdb.com/title/tt1701990/?ref_=fn_tt_tt_1
 
1
http://www.imdb.com/title/tt0119494/?ref_=fn_tt_tt_1
 
1
http://www.imdb.com/title/tt0300214/?ref_=fn_tt_tt_1
 
1
http://www.imdb.com/title/tt0387808/?ref_=fn_tt_tt_1
 
1
http://www.imdb.com/title/tt0790724/?ref_=fn_tt_tt_1
 
1
Other values (4436)
4436
ValueCountFrequency (%) 
http://www.imdb.com/title/tt1701990/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0119494/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0300214/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0387808/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0790724/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0104431/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0102138/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt1661382/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt2103267/?ref_=fn_tt_tt_1 1 < 0.1%
 
http://www.imdb.com/title/tt0286244/?ref_=fn_tt_tt_1 1 < 0.1%
 
Other values (4431) 4431 99.8%
 
ValueCountFrequency (%) 
http 4441 100.0%
 
ValueCountFrequency (%) 
www.imdb.com 4441 100.0%
 
ValueCountFrequency (%) 
/title/tt0166195/ 1 < 0.1%
 
/title/tt0365885/ 1 < 0.1%
 
/title/tt2387559/ 1 < 0.1%
 
/title/tt1542344/ 1 < 0.1%
 
/title/tt0099422/ 1 < 0.1%
 
/title/tt2183034/ 1 < 0.1%
 
/title/tt0261392/ 1 < 0.1%
 
/title/tt1034331/ 1 < 0.1%
 
/title/tt0790736/ 1 < 0.1%
 
/title/tt0156323/ 1 < 0.1%
 
Other values (4431) 4431 99.8%
 
ValueCountFrequency (%) 
ref_=fn_tt_tt_1 4441 100.0%
 
ValueCountFrequency (%) 
4441 100.0%
 

movie_title
Categorical

UNIQUE
HIGH CARDINALITY
Distinct count4441
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Donkey Punch 
 
1
Strangerland 
 
1
Capote 
 
1
Vera Drake 
 
1
Hotel for Dogs 
 
1
Other values (4436)
4436
ValueCountFrequency (%) 
Donkey Punch  1 < 0.1%
 
Strangerland  1 < 0.1%
 
Capote  1 < 0.1%
 
Vera Drake  1 < 0.1%
 
Hotel for Dogs  1 < 0.1%
 
A Madea Christmas  1 < 0.1%
 
Paddington  1 < 0.1%
 
Seed of Chucky  1 < 0.1%
 
Mrs Henderson Presents  1 < 0.1%
 
Highlander  1 < 0.1%
 
Other values (4431) 4431 99.8%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length87
Mean length16.28124296
Min length2
Scatter

num_critic_for_reviews
Real number (ℝ≥0)

Distinct count527
Unique (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean148.6437739
Minimum1
Maximum813
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile17
Q162
median120
Q3202
95-th percentile391
Maximum813
Range812
Interquartile range (IQR)140

Descriptive statistics

Standard deviation119.6388771
Coefficient of variation (CV)0.8048697496
Kurtosis2.990517209
Mean148.6437739
Median Absolute Deviation (MAD)90.30978445
Skewness1.52534826
Sum660127
Variance14313.46091
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1. 15.5 98.5 169.5 233.5 292.5 377.5 492.5 607. 813. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
81 30 0.7%
 
112 28 0.6%
 
97 28 0.6%
 
43 27 0.6%
 
25 27 0.6%
 
64 27 0.6%
 
50 26 0.6%
 
63 26 0.6%
 
29 26 0.6%
 
61 26 0.6%
 
Other values (517) 4170 93.9%
 
ValueCountFrequency (%) 
1 8 0.2%
 
2 12 0.3%
 
3 7 0.2%
 
4 9 0.2%
 
5 15 0.3%
 
ValueCountFrequency (%) 
813 1 < 0.1%
 
775 1 < 0.1%
 
765 1 < 0.1%
 
750 1 < 0.1%
 
739 1 < 0.1%
 

num_user_for_reviews
Real number (ℝ≥0)

Distinct count953
Unique (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean290.4807476
Minimum1
Maximum5060
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile22
Q183
median174
Q3345
95-th percentile928
Maximum5060
Range5059
Interquartile range (IQR)262

Descriptive statistics

Standard deviation382.9447948
Coefficient of variation (CV)1.318313857
Kurtosis26.71061023
Mean290.4807476
Median Absolute Deviation (MAD)231.5473951
Skewness4.145034849
Sum1290025
Variance146646.7159
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.0000e+00 9.5000e+00 1.5050e+02 2.1650e+02 2.9150e+02 ... 9.1850e+02 1.1955e+03 1.5310e+03 2.8085e+03 5.0600e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
26 26 0.6%
 
50 24 0.5%
 
39 21 0.5%
 
31 21 0.5%
 
53 21 0.5%
 
69 20 0.5%
 
90 20 0.5%
 
73 20 0.5%
 
32 20 0.5%
 
55 19 0.4%
 
Other values (943) 4229 95.2%
 
ValueCountFrequency (%) 
1 5 0.1%
 
2 3 0.1%
 
3 8 0.2%
 
4 5 0.1%
 
5 6 0.1%
 
ValueCountFrequency (%) 
5060 1 < 0.1%
 
4667 1 < 0.1%
 
4144 1 < 0.1%
 
3646 1 < 0.1%
 
3597 1 < 0.1%
 

num_voted_users
Real number (ℝ≥0)

Distinct count4353
Unique (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90310.95632
Minimum28
Maximum1689764
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum28
5-th percentile1599
Q112324
median40346
Q3104301
95-th percentile351274
Maximum1689764
Range1689736
Interquartile range (IQR)91977

Descriptive statistics

Standard deviation142873.8382
Coefficient of variation (CV)1.582021097
Kurtosis23.23207544
Mean90310.95632
Median Absolute Deviation (MAD)87557.4915
Skewness3.935380646
Sum401070957
Variance2.041293365e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[2.800000e+01 4.380000e+02 9.205000e+02 6.380500e+03 1.684750e+04 ... 2.222510e+05 3.340960e+05 5.374305e+05 8.992820e+05 1.689764e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3119 3 0.1%
 
3665 3 0.1%
 
2541 3 0.1%
 
922 2 < 0.1%
 
25332 2 < 0.1%
 
23023 2 < 0.1%
 
36108 2 < 0.1%
 
27882 2 < 0.1%
 
12980 2 < 0.1%
 
1231 2 < 0.1%
 
Other values (4343) 4418 99.5%
 
ValueCountFrequency (%) 
28 1 < 0.1%
 
48 1 < 0.1%
 
50 1 < 0.1%
 
53 2 < 0.1%
 
60 1 < 0.1%
 
ValueCountFrequency (%) 
1689764 1 < 0.1%
 
1676169 1 < 0.1%
 
1468200 1 < 0.1%
 
1347461 1 < 0.1%
 
1324680 1 < 0.1%
 

plot_keywords
Categorical

HIGH CARDINALITY
Distinct count4437
Unique (%)99.9%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
one word title
 
3
based on novel
 
3
paralympics|quad rugby|rugby|team|wheelchair
 
1
17th century|girl|maid|painter|painting
 
1
apprentice|demon|exorcism|master apprentice relationship|witch
 
1
Other values (4432)
4432
ValueCountFrequency (%) 
one word title 3 0.1%
 
based on novel 3 0.1%
 
paralympics|quad rugby|rugby|team|wheelchair 1 < 0.1%
 
17th century|girl|maid|painter|painting 1 < 0.1%
 
apprentice|demon|exorcism|master apprentice relationship|witch 1 < 0.1%
 
baby|desert island|island|sequel|teenage girl 1 < 0.1%
 
ejected from a moving vehicle|gun held to head|handcuffs|shot multiple times|strangulation 1 < 0.1%
 
abdication|china|emperor|forbidden city|republic 1 < 0.1%
 
cattle|cow|dairy farm|farm|rustler 1 < 0.1%
 
mutant|superhero|superhero team|x men|year 1983 1 < 0.1%
 
Other values (4427) 4427 99.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length149
Mean length52.49268183
Min length2
Scatter

title_year
Real number (ℝ≥0)

Distinct count88
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2001.929971
Minimum1927
Maximum2016
Zeros0
Zeros (%)0.0%
Memory size34.8 KiB
Mini histogram

Quantile statistics

Minimum1927
5-th percentile1978
Q11998
median2005
Q32010
95-th percentile2015
Maximum2016
Range89
Interquartile range (IQR)12

Descriptive statistics

Standard deviation12.34025148
Coefficient of variation (CV)0.006164177397
Kurtosis6.832351141
Mean2001.929971
Median Absolute Deviation (MAD)8.477429589
Skewness-2.213872958
Sum8890571
Variance152.2818065
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1927. 1960.5 1976.5 1979.5 1992.5 1995.5 1998.5 2003.5 2014.5 2016. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2009 229 5.2%
 
2006 220 5.0%
 
2008 214 4.8%
 
2010 210 4.7%
 
2011 210 4.7%
 
2005 203 4.6%
 
2004 197 4.4%
 
2002 196 4.4%
 
2013 192 4.3%
 
2012 191 4.3%
 
Other values (78) 2379 53.6%
 
ValueCountFrequency (%) 
1927 1 < 0.1%
 
1929 2 < 0.1%
 
1930 1 < 0.1%
 
1932 1 < 0.1%
 
1933 2 < 0.1%
 
ValueCountFrequency (%) 
2016 72 1.6%
 
2015 152 3.4%
 
2014 184 4.1%
 
2013 192 4.3%
 
2012 191 4.3%
 

Correlations

Missing values

Sample

First rows

actor_1_facebook_likesactor_1_nameactor_2_facebook_likesactor_2_nameactor_3_facebook_likesactor_3_nameaspect_ratiobudgetcast_total_facebook_likescolorcontent_ratingcountrydf_indexdirector_facebook_likesdirector_namedurationfacenumber_in_postergenresgrossimdb_scorelanguagemovie_facebook_likesmovie_imdb_linkmovie_titlenum_critic_for_reviewsnum_user_for_reviewsnum_voted_usersplot_keywordstitle_year
01000.0CCH Pounder936.0Joel David Moore855.0Wes Studi1.78237000000.04834ColorPG-13USA00.0James Cameron178.00.0Action|Adventure|Fantasy|Sci-Fi760505847.07.9English33000http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1Avatar723.03054.0886204avatar|future|marine|native|paraplegic2009.0
140000.0Johnny Depp5000.0Orlando Bloom1000.0Jack Davenport2.35300000000.048350ColorPG-13USA1563.0Gore Verbinski169.00.0Action|Adventure|Fantasy309404152.07.1English0http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1Pirates of the Caribbean: At World's End302.01238.0471220goddess|marriage ceremony|marriage proposal|pirate|singapore2007.0
211000.0Christoph Waltz393.0Rory Kinnear161.0Stephanie Sigman2.35245000000.011700ColorPG-13UK20.0Sam Mendes148.01.0Action|Adventure|Thriller200074175.06.8English85000http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1Spectre602.0994.0275868bomb|espionage|sequel|spy|terrorist2015.0
327000.0Tom Hardy23000.0Christian Bale23000.0Joseph Gordon-Levitt2.35250000000.0106759ColorPG-13USA322000.0Christopher Nolan164.00.0Action|Thriller448130642.08.5English164000http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1The Dark Knight Rises813.02701.01144337deception|imprisonment|lawlessness|police officer|terrorist plot2012.0
4640.0Daryl Sabara632.0Samantha Morton530.0Polly Walker2.35263700000.01873ColorPG-13USA5475.0Andrew Stanton132.01.0Action|Adventure|Sci-Fi73058679.06.6English24000http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1John Carter462.0738.0212204alien|american civil war|male nipple|mars|princess2012.0
524000.0J.K. Simmons11000.0James Franco4000.0Kirsten Dunst2.35258000000.046055ColorPG-13USA60.0Sam Raimi156.00.0Action|Adventure|Romance336530303.06.2English0http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1Spider-Man 3392.01902.0383056sandman|spider man|symbiote|venom|villain2007.0
6799.0Brad Garrett553.0Donna Murphy284.0M.C. Gainey1.85260000000.02036ColorPGUSA715.0Nathan Greno100.01.0Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance200807262.07.8English29000http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1Tangled324.0387.029481017th century|based on fairy tale|disney|flower|tower2010.0
726000.0Chris Hemsworth21000.0Robert Downey Jr.19000.0Scarlett Johansson2.35250000000.092000ColorPG-13USA80.0Joss Whedon141.04.0Action|Adventure|Sci-Fi458991599.07.5English118000http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1Avengers: Age of Ultron635.01117.0462669artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero2015.0
825000.0Alan Rickman11000.0Daniel Radcliffe10000.0Rupert Grint2.35250000000.058753ColorPGUK9282.0David Yates153.03.0Adventure|Family|Fantasy|Mystery301956980.07.5English10000http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1Harry Potter and the Half-Blood Prince375.0973.0321795blood|book|love|potion|professor2009.0
915000.0Henry Cavill4000.0Lauren Cohan2000.0Alan D. Purwin2.35250000000.024450ColorPG-13USA100.0Zack Snyder183.00.0Action|Adventure|Sci-Fi330249062.06.9English197000http://www.imdb.com/title/tt2975590/?ref_=fn_tt_tt_1Batman v Superman: Dawn of Justice673.03018.0371639based on comic book|batman|sequel to a reboot|superhero|superman2016.0

Last rows

actor_1_facebook_likesactor_1_nameactor_2_facebook_likesactor_2_nameactor_3_facebook_likesactor_3_nameaspect_ratiobudgetcast_total_facebook_likescolorcontent_ratingcountrydf_indexdirector_facebook_likesdirector_namedurationfacenumber_in_postergenresgrossimdb_scorelanguagemovie_facebook_likesmovie_imdb_linkmovie_titlenum_critic_for_reviewsnum_user_for_reviewsnum_voted_usersplot_keywordstitle_year
4431830.0Mark Duplass224.0Katie Aselton10.0Bari Hyman2.1041315000.01064ColorRUSA5021157.0Jay Duplass85.00.0Comedy|Drama|Romance1.924670e+056.6English297http://www.imdb.com/title/tt0436689/?ref_=fn_tt_tt_1The Puffy Chair51.071.04067birthday|gift|motel|new york city|upholsterer2005.0
4432407.0Sean Whalen91.0Jason Trost86.0Nick Principe2.3500020000.0674ColorUnratedUSA502491.0Jason Trost78.00.0Sci-Fi|Thriller4.764451e+074.0English835http://www.imdb.com/title/tt1836212/?ref_=fn_tt_tt_1All Superheroes Must Die42.035.01771arch villain|game of death|kidnapping|superhero2011.0
4433462.0Divine143.0Mink Stole105.0Edith Massey1.3700010000.0760ColorNC-17USA50250.0John Waters108.02.0Comedy|Crime|Horror1.804830e+056.1English0http://www.imdb.com/title/tt0069089/?ref_=fn_tt_tt_1Pink Flamingos73.0183.016792absurd humor|egg|gross out humor|lesbian|sex1972.0
4434576.0Maggie Cheung133.0Béatrice Dalle45.0Don McKellar2.350004500.0776ColorRFrance5026107.0Olivier Assayas110.01.0Drama|Music|Romance1.360070e+056.9French171http://www.imdb.com/title/tt0388838/?ref_=fn_tt_tt_1Clean81.039.03924jail|junkie|money|motel|singer2004.0
44355.0Fereshteh Sadre Orafaiy0.0Nargess Mamizadeh0.0Mojgan Faramarzi1.8500010000.05ColorNot RatedIran5027397.0Jafar Panahi90.00.0Drama6.737800e+057.5Persian697http://www.imdb.com/title/tt0255094/?ref_=fn_tt_tt_1The Circle64.026.04555abortion|bus|hospital|prison|prostitution2000.0
4436291.0Shane Carruth45.0David Sullivan8.0Casey Gooden1.850007000.0368ColorPG-13USA5033291.0Shane Carruth77.00.0Drama|Sci-Fi|Thriller4.247600e+057.0English19000http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1Primer143.0371.072639changing the future|independent film|invention|nonlinear timeline|time travel2004.0
44370.0Ian Gamazon0.0Edgar Tancangco0.0Quynn Ton2.104137000.00ColorNot RatedPhilippines50340.0Neill Dela Llana80.00.0Thriller7.007100e+046.3English74http://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_1Cavite35.035.0589jihad|mindanao|philippines|security guard|squatter2005.0
4438121.0Carlos Gallardo20.0Peter Marquardt6.0Consuelo Gómez1.370007000.0147ColorRUSA50350.0Robert Rodriguez81.00.0Action|Crime|Drama|Romance|Thriller2.040920e+066.9Spanish0http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1El Mariachi56.0130.052055assassin|death|guitar|gun|mariachi1992.0
4439296.0Kerry Bishé205.0Caitlin FitzGerald133.0Daniella Pineda2.104139000.0690ColorNot RatedUSA50370.0Edward Burns95.01.0Comedy|Drama4.584000e+036.4English413http://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_1Newlyweds14.014.01338written and directed by cast member2011.0
444086.0John August23.0Brian Herzlinger16.0Jon Gunn1.850001100.0163ColorPGUSA504216.0Jon Gunn90.00.0Documentary8.522200e+046.6English456http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1My Date with Drew43.084.04285actress name in title|crush|date|four word title|video camera2004.0